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DETAILED ACTION 

1 . This action is issued in response to the Amendment filed on 02/12/2007. 

2. Claims 1,3-6, 8 f 15, 17-20, 25, 27 - 31 , 37, 39 - 42, 44, 51 , and 53 - 56 
were amended. Claims 7, and 43 were canceled. No claims were added. 

3. This action is made Final. 

4. Claims 1 - 6, 8 - 42, and 44 - 60 are pending in this application. 

5. Applicant's arguments with respect to amended claims 1,3 — 6,8,15,17 — 20, 
25, 27 - 31 , 37, 39 - 42, 44, 51 , and 53 - 56 have been considered but are moot in 
view of the new ground(s) of rejection. 

Claim Rejections - 35 USC § 103 

6. The following is a quotation of 35 U.S.C. 103(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

7. Claims 1 - 6, 9 -14, 16 - 30, 32 - 42, and 45 - 50, and 52 - 60 are rejected 
under 35 U.S.C. 103(a) as being unpatentable over Broder et al. (Broder hereinafter) 
(US Patent Application Pub No. 2004/0243560 A1 , filed: May 30, 2003) in view of Moffat 
et al. (Moffat hereinafter) (Non-Patent Literature: "Self-Indexing Inverted Files for Fast 
Text Retrieval"; Alistar Moffat, and Justin Zobel; February 1994, 1994 Australian 
Database Conference, and 1994 IEEE Conference on Data Engineering). 
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Regarding Claim 1 and 37, Broder discloses an article comprising a machine- 
readable medium storing instructions operable to cause one or more machines to 
perform operations comprising: 

determining a first value x such that x is determined as an integer corresponding 
to a quantity of documents including at least a majority of the index terms (Page 17 and 
18, [0307] and [0314], lines 1 - 3 and 3 - 9; respectively, Broder 1 ). 

determining a second value y, where y does not exceed x, (Page 15, [0277], 
lines 1 - 2, k be the smallest index, Broder). 

However, Broder does not explicitly disclose that: x is representative of a first 
location of a first skip entry in an inverted index; and that the second value y determined 
as an integer and representative of a second location of a second skip entry in the 
inverted index. On the other hand, Moffat discloses that: x is representative of a first 
location of a first skip entry in an inverted index (Page 14, Section 4.1 : Skipping, 1 st - 3 rd 
paragraph in that page, "if synchronization points-additional locations at which decoding 
can commence - are introduced into the compressed inverted list...", "...To access the 
compressed list to see if document d appears, the first skip is decoded to obtain the 

1 Wherein the top "n" documents corresponds to the documents with the majority of index terms claimed; 
specifically u n" corresponds to the value x claimed. The scoring procedure utilized by Broder (as disclosed 
in detailed in Page 16, [0286], lines 5-6, Broder) including terms associations with upper bounds on its 
maximal contribution to documents scores; wherein the maximal contributing of index terms corresponds 
to the majority of the index terms as claimed. Additionally, the top n scoring documents corresponds to a 
quantity of documents as claimed. 
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address a2 of the second skip, which is also decoded..."; Moffat); and that: the second 
value y determined as an integer and representative of a second location of a second 
skip entry in the inverted index (Page 14, 1 st and 2 nd paragraph, "...the inverted list 
becomes a sequence of blocks of three pairs each, with skips separating the blocks...", 
and ". . .a2 is the address of the first bit of the second skip pair, a3 is the address of the 
first bit of the third skip, and so on...", Moffat). It would have been obvious to one of 
ordinary skill in the art at the time the invention was made to incorporate the Moffat's 
teachings to the system Broder. Skilled artisan would have been motivated to do so, as 
suggested by Moffat (Page 15 and 16, paragraph 5, and 2; respectively, Moffat), to 
allow both disk space and query processing time to be reduced by having a faster 
inverted file index by fetching fewer terms and list. In addition, both of the references 
(Broder and Moffat) teach features that are directed to analogous art and they are 
directed to the same field of endeavor, such as, databases management systems, 
inverted lists, skipping, and indexing. This close relation between both of the references 
highly suggests an expectation of success. 

Furthermore, the combination of Broder in view of Moffat discloses: 
generating the inverted index for the collection of documents (Page 9, [0162], 
lines 1 - 3, Broder; and Page 1, Abstract, "...an internal index in each inverted list...", 
Moffat), the inverted index including an inverted list for each of the index terms (Page 
14, [0244], lines 2-3, Broder), each inverted list including at least one posting (Page 
14, [0244], lines 2-5, Broder) and, if the number of postings exceeds x (Page 14, 
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[0245], lines 7-10, Broder 2 ), further including the first skip entry after the xth posting 
(Page 14, [0245], lines 10-12, Broder; Page 13, Section: 4.1 Skipping, "...When k«p, 
faster performance is possible if synchronization points-additional locations at which 
decoding can commence-are introduced into the compressed inverted list...", Moffat) 
and one or more second skip entries thereafter at intervals of every yth posting (Page 
15, [0277], lines 1 - 6, Broder 3 ; Page 14, 1 st and 2 nd paragraph, '.'...the inverted list 
becomes a sequence of blocks of three pairs each, with skips separating the blocks...", 
and "...a2 is the address of the first bit of the second skip pair, a3 is the address of the 
first bit of the third skip, and so on...", Moffat); 
wherein: 

the at least one posting includes a document identifier identifying a 
document in the collection of documents (Page 14, [0244], lines 1-6, document unique 
identifier DID, Broder; and Page 3, Section 2: Document Database, 1 st paragraph, 
"Each document is known by a unique indetifier...", Moffat); 

the first and second skip entries including a document identifier that is 
included in a boundary posting of a block of postings immediately adjacent to the skip 
entry in the inverted list (Page 15, [0277], lines 1 - 6, Border 4 ; and Page 13, Section 4.1 
Skipping, 6 th paragraph, "... For example, suppose that p1 synchronisation points are 

2 Broder discloses that the method next(id) (page 14, [0245], lines 8-10). This method states that if there 
is not such document which DID >=id (where DID = number of documents, and id = number of posting), 
then the term iterator returns a special posting that is larger than all the existing DIDs. This implies that, if 
there is DID < id (the number of postings exceeds the number of documents entered), then the iterator 
returns the special posting mentioned above. 

3 The cursor is advanced to the position of k value. There is a skip at the kth value. 

4 Wherein doc_i corresponds to the document identifier claimed. In addition, the step of advancing the 
cursor to next docj k corresponds to the step of 
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allowed. Then the index into the inverted list contains p1 'document number, bit 
address' pairs, and can itself be stored as a compressed sequence of 'difference in 
document number, difference in bit address'..."; Moffat), where a block of postings 
includes postings having document identifiers ranging from a lower to an upper value 
and where a boundary posting is a posting having a document identifier of either the 
lower or the upper value (Page 16 and 18, [0286] and [0314], lines 5-9 and 3-9; 
respectively, Broder). 

Regarding Claims 2, and 38, the combination of Broder in view of Moffat 
discloses a method, wherein each posting further includes a position identifier 
identifying a position of the index term in the document (Page 14, [0244], lines 8-9, 
offsets of occurrences, Broder), and a frequency of the index term occurring in the 
document (Page 14 and 18, [0244] and [0316], lines 7 - 8 and 9 - 11, number of 
occurrences of the terms/ frequency; respectively, Broder). 

Regarding Claims 3, and 39, the combination of Broder in view of Moffat 
discloses a method, wherein the boundary posting includes a document identifier having 
the lower value in the range of values and the block of postings follow the first skip entry 
in the inverted list (Fig. 27, item 5, Page 16 and 17, [0301], lines 10-18, Broder 5 ; and 
Page 14, Section 4.1: Skipping, 1 st - 3 rd paragraph in that page, "if synchronization 



5 According to Broder, the pivot DID is the smallest DID that might be a candidate (Page 17, [0301], lines 
17 - 18). In addition, Broder discloses that this pivot term goes through an "if statement which finds a 
first pivot term with UB (upper bound) greater than the threshold (Fig. 27, item 5). This implies that the 
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points-additional locations at which decoding can commence - are introduced into the 
compressed inverted list...", "...To access the compressed list to see if document d 
appears, the first skip is decoded to obtain the address a2 of the second skip, which is 
also decoded..."; Moffat). 

Regarding Claims 4, and 40, the combination of Broder in view of Moffat 
discloses a method, wherein the first skip entry further includes information to locate the 
next skip entry in the inverted list (Fig. 27, items 13 and 22, Broder 6 ; and Page 14, 
Section 4.1 : Skipping, 1 st - 3 rd paragraph in that page, "if synchronization points- 
additional locations at which decoding can commence - are introduced into the 
compressed inverted list...", "...To access the compressed list to see if document d 
appears, the first skip is decoded to obtain the address a2 of the second skip, which is 
also decoded..."; Moffat). 

Regarding Claims 5, and 41 , the combination of Broder in view of Moffat 
discloses a method, wherein the boundary posting includes a document identifier having 
the higher value in the range of values and the block of postings precede the first skip 
entry in the inverted list (Page 17, [0302], lines 22 - 26, Broder 7 ; and Page 14, Section 
4.1: Skipping, 1 st - 3 rd paragraph in that page, "if synchronization points-additional 

"next" method will return the smallest possible document number following the last one. In addition, the 
smallest document number corresponds to the lower value claimed. 

6 Posting[aterm] (located in the function next()) represents the information to locate next skip. 

7 The next() function iterates trough the list and selects from the preceding terms the term with the 
location greater (largest document number of documents) than the pivot location. Wherein the location 
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locations at which decoding can commence - are introduced into the compressed 
inverted list...", "...To access the compressed list to see if document of appears, the first 
skip is decoded to obtain the address a2 of the second skip, which is also decoded..."; 
Moffat). 

Regarding Claims 6, and 42, the combination of Broder in view of Moffat 
discloses a method, wherein the first skip entry further includes information to locate the 
next skip entry in the inverted list (Fig. 27, items 13 and 22, Broder 8 ; and Page 14, 
Section 4.1 : Skipping, 1 st - 3 rd paragraph in that page, "if synchronization points- 
additional locations at which decoding can commence - are introduced into the 
compressed inverted list...", "...To access the compressed list to see if document d 
appears, the first skip is decoded to obtain the address a2 of the second skip, which is 
also decoded..."; Moffat). 

Regarding Claims 9, and 45, the combination of Broder in view of Moffat 
discloses a method, wherein the collection of one or more documents includes one or 
more binary files, data tables, source code files, text documents or combinations thereof 
(Page 9, [0158], lines 1-13, Broder). 



greater (largest document number of documents) than the pivot location corresponds to the higher value 
as claimed. 

8 Posting[aterm] (located in the function nextQ) represents the information to locate next skip. 
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Regarding Claims 10, and 46, the combination of Broder in view of Moffat 
discloses a method including all the limitations of claim 1 , and 37, as disclosed above, 
further comprising: 

compressing the inverted index (Page 15, [0273], lines 1 - 3, zipping, Broder). 

Regarding Claims 1 1 , and 47, the combination of Broder in view of Moffat 
discloses a method, wherein substantially all of the index terms occur in x documents or 
fewer (Page 15, [0257], lines 14-16, Broder). 

Regarding Claims 12, and 48, the combination of Broder in view of Moffat 
discloses a method, wherein at least approximately 80% of the index terms occur in x 
documents (Page 17, [0307], lines 1 - 3, top n results, Broder). 

Regarding Claims 13, and 49, the combination of Broder in view of Moffat 
discloses a method, wherein for each inverted list, if the number of postings exceeds x, 
further including a skip entry before the first posting in the inverted list (Page 15, [0257], 
lines 12-20, the result is inserted, Broder; and Page 14, Section 4.1: Skipping, 1 st - 3 rd 
paragraph in that page, "if synchronization points-additional locations at which decoding 
can commence - are introduced into the compressed inverted list...", "...To access the 
compressed list to see if document d appears, the first skip is decoded to obtain the 
address a2 of the second skip, which is also decoded..."; Moffat). 
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Regarding Claims 14, and 50, the combination of Broder in view of Moffat 
discloses a method, wherein for each inverted list, if the number of postings exceeds x 
(Page 14, [0245], lines 7-10, Broder 9 ), further including a skip entry after the last 
posting in the inverted list (Page 14, [0245], lines 10-12, Broder; and Page 14, Section 
4.1 : Skipping, 1 st - 3 rd paragraph in that page, "if synchronization points-additional 
locations at which decoding can commence - are introduced into the compressed 
inverted list. . .", ". . To access the compressed list to see if document d appears, the first 
skip is decoded to obtain the address a2 of the second skip, which is also decoded..."; 
Moffat). 

Regarding Claims 16, and 52, the combination of Broder in view of Moffat 
discloses a method, wherein each posting further includes, a position identifier 
identifying a position of the index term in the document (Page 14, [0244], lines 8-9, 
offsets of occurrences, Broder), and a frequency of the index term occurring in the 
document (Page 14 and 18, [0244] and [0316], lines 7 - 8 and 9 - 1 1 , number of 
occurrences of the terms/ frequency; respectively, Broder). 

Regarding Claims 1 7, and 53, the combination of Broder in view of Moffat 
discloses a method, wherein wherein the boundary posting includes a document 
identifier having the lower value in the range of values and the block of postings follow 

9 Broder discloses that the method next(id) (page 14, [0245], lines 8-10). This method states that if there 
is not such document which DID >=id (where DID = number of documents, and id = number of posting), 
then the term iterator returns a special posting that is larger than all the existing DIDs. This implies that, if 
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the first skip entry in the inverted list (Fig. 27, item 5, Page 16 and 17, [0301], lines 10 - 
18, Broder 10 ; and Page 14, Section 4.1: Skipping, 1 st - 3 rd paragraph in that page, "if 
synchronization points-additional locations at which decoding can commence - are 
introduced into the compressed inverted list...", "...To access the compressed list to see 
if document d appears, the first skip is decoded to obtain the address a2 of the second 
skip, which is also decoded..."; Moffat). 

Regarding Claims 18, and 54, the combination of Broder in view of Moffat 
discloses a method, wherein the first skip entry further includes information to locate the 
next skip entry in the inverted list (Fig. 27, items 13 and 22, Broder 11 and Page 14, 
Section 4.1 : Skipping, 1 st - 3 rd paragraph in that page, "if synchronization points- 
additional locations at which decoding can commence - are introduced into the 
compressed inverted list...", "...To access the compressed list to see if document d 
appears, the first skip is decoded to obtain the address a2 of the second skip, which is 
also decoded..."; Moffat). 

Regarding Claims 19, and 55, the combination of Broder in view of Moffat 
discloses a method, wherein the boundary posting includes a document identifier having 



there is DID < id (the number of postings exceeds the number of documents entered), then the iterator 
returns the special posting mentioned above. 

10 According to Broder, the pivot DID is the smallest DID that might be a candidate (Page 17, [0301], lines 
17 - 18). In addition, Broder discloses that this pivot term goes through an "if statement which finds a 
first pivot term with UB (upper bound) greater than the threshold (Fig. 27, item 5). This implies that the 
"next" method will return the smallest possible document number following the last one. In addition, the 
smallest document number corresponds to the lower value claimed. 

11 Posting[aterm] (located in the function nextQ) represents the information to locate next skip. 
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the higher value in the range of values and the block of postings precede the first skip 
entry in the inverted list (Page 17, [0302], lines 22 - 26, Broder 12 ; and Page 14, Section 
4.1 : Skipping, 1 st - 3 rd paragraph in that page, "if synchronization points-additional 
locations at which decoding can commence - are introduced into the compressed 
inverted list. . .", ". . To access the compressed list to see if document d appears, the first 
skip is decoded to obtain the address a2 of the second skip, which is also decoded..."; 
Moffat). 

Regarding Claims 20, and 56, the combination of Broder in view of Moffat 
discloses a method, wherein the first skip entry further includes information to locate the 
next skip entry in the inverted list (Fig. 27, items 13 and 22, Broder 13 ; and Page 14, 
Section 4.1 : Skipping, 1 st - 3 rd paragraph in that page, "if synchronization points- 
additional locations at which decoding can commence - are introduced into the 
compressed inverted list...", "...To access the compressed list to see if document d 
appears, the first skip is decoded to obtain the address a2 of the second skip, which is 
also decoded..."; Moffat). 

Regarding Claims 21 , and 57, the combination of Broder in view of Moffat 
discloses a method, wherein substantially all of the index terms occur in x documents or 
fewer (Page 15, [0257], lines 14-16, Broder). 

12 The next() function iterates trough the list and selects from the preceding terms the term with the 
location greater (largest document number of documents) than the pivot location. Wherein the location 
greater (largest document number of documents) than the pivot location corresponds to the higher value 
as claimed. 
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Regarding Claims 22, and 58, the combination of Broder in view of Moffat 
discloses a method, wherein approximately 80 to 90% of the index terms occur in x 
documents or fewer (Page 1 7, [0307], lines 1 - 3, top n results, Broder). 

Regarding Claims 23, and 59, the combination of Broder in view of Moffat 
discloses a method, wherein for each inverted list, if the number of postings exceeds x, 
further including a skip entry before the first posting in the inverted list (Page 15, [0257], 
lines 12 - 20, the result is inserted, Broder; and Page 14, Section 4.1: Skipping, 1 st - 3 rd 
paragraph in that page, "if synchronization points-additional locations at which decoding 
can commence - are introduced into the compressed inverted list...", "...To access the 
compressed list to see if document d appears, the first skip is decoded to obtain the 
address a2 of the second skip, which is also decoded..."; Moffat). 

Regarding Claims 24, and 60, the combination of Broder in view of Moffat 
discloses a method, wherein for each inverted list, if the number of postings exceeds x 
(Page 14, [0245], lines 7-10, Broder 14 ), further including a skip entry after the last 
posting in the inverted list (Page 14, [0245], lines 10-12, Broder; and Page 14, Section 
4.1: Skipping, 1 st - 3 rd paragraph in that page, "if synchronization points-additional 

13 Posting[aterm] (located in the function next()) represents the information to locate next skip. 
14 Broder discloses that the method next(id) (page 14, [0245], lines 8-10). This method states that if 
there is not such document which DID >=id (where DID = number of documents, and id = number of 
posting), then the term iterator returns a special posting that is larger than all the existing DIDs. This 
implies that, if there is DID < id (the number of postings exceeds the number of documents entered), then 
the iterator returns the special posting mentioned above. 
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locations at which decoding can commence - are introduced into the compressed 
inverted list. . .", ". . .To access the compressed list to see if document of appears, the first 
skip is decoded to obtain the address a2 of the second skip, which is also decoded..."; 
Moffat). 

Regarding Claim 25, the combination of Broder in view of Moffat discloses an 
inverted index for a collection of documents (Page 9, [0162], lines 1-3, Broder), each 
document comprising one or more index terms (Page 14, [0244], lines 3-5, Broder), 
the inverted index comprising: 

an inverted list for each index term in the collection of documents (Page 14, 
[0244], lines 2-3, Broder); and 

one or more inverted lists including a quantity of postings (Page 14, [0244], lines 
2-5, Broder) that exceeds a value x (Page 14, [0245], lines 7-10, Broder), a skip 
entry after the xth posting (Page 14, [0245], lines 10-12, Broder), and one or more 
additional skip entries thereafter at intervals of every yth posting (Page 1 5, [0277], lines 
1 - 6, Broder 15 ; and Page 14, Section 4.1: Skipping, 1 st - 3 rd paragraph in that page, "if 
synchronization points-additional locations at which decoding can commence - are 
introduced into the compressed inverted list...", "...To access the compressed list to see 
if document d appears, the first skip is decoded to obtain the address a2 of the second 
skip, which is also decoded..."; Moffat), where the value x is determined as an integer 
corresponding to a quantity of documents including at least a majority of the index terms 



15 The cursor is advanced to the position of k value. There is a skip at the kth value. 
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(Page 17 and 18, [0307] and [0314], lines 1 - 3 and 3 - 9; respectively, Broder 16 ) and 
the value y is determined as an integer and does not exceed the value x (Page 1 5, 
[0277], lines 1 - 2, Broder 17 ); 
wherein: 

a posting includes a document identifier identifying a document in the 
collection of documents (Page 14, [0244], lines 1 - 6, document unique identifier DID, 
Broder); 

a skip entry includes a document identifier that is included in a boundary 
posting of a block of postings immediately adjacent to the skip entry in the inverted list 
(Page 15, [0277], lines 1 - 6, Border 18 ; and Page 14, Section 4.1: Skipping, 1 st - 3 rd 
paragraph in that page, "if synchronization points-additional locations at which decoding 
can commence - are introduced into the compressed inverted list...", "...To access the 
compressed list to see if document d appears, the first skip is decoded to obtain the 
address a2 of the second skip, which is also decoded..."; Moffat), where a block of 
postings includes postings having document identifiers ranging from a lower to an upper 
value and where a boundary posting is a posting having a document identifier of either 
the lower or the upper value (Page 1 6 and 1 8, [0286] and [031 4], lines 5 - 9 and 3 - 9; 
respectively, Broder). 

16 Wherein the top "n" documents corresponds to the documents with the majority of index terms claimed; 
specifically "n" corresponds to the value x claimed. The scoring procedure utilized by Broder (as disclosed 
in detailed in Page 16, [0286], lines 5-6, Broder) including terms associations with upper bounds on its 
maximal contribution to documents scores; wherein the maximal contributing of index terms corresponds 
to the majority of the index terms as claimed. Additionally, the top n scoring documents corresponds to a 
quantity of documents as claimed. 

1 Y would be the smallest index k. And x would be the top n documents (the size of the heap) (Page 15, 
(0257], lines 15-16). 
Wherein doc_i corresponds to the document identifier claimed. 
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Regarding Claim 26, the combination of Broder in view of Moffat discloses an 
inverted index, wherein each posting further includes position identifier identifying a 
position of the index term in the document (Page 14, [0244], lines 8-9, offsets of 
occurrences, Broder), and a frequency of the index term occurring in the document 
(Page 14 and 18, [0244] and [0316], lines 7-8 and 9-11, number of occurrences of 
the terms/ frequency; respectively, Broder). 

Regarding Claim 27, the combination of Broder in view of Moffat discloses an 
inverted index, wherein the boundary posting includes a document identifier having the 
lower value in the range of values and the block of postings follow the first skip entry in 
the inverted list (Fig. 27, item 5, Page 16 and 17, [0301], lines 10-18, Broder 19 ; and 
Page 14, Section 4.1: Skipping, 1 st - 3 rd paragraph in that page, "if synchronization 
points-additional locations at which decoding can commence - are introduced into the 
compressed inverted list...", "...To access the compressed list to see if document of 
appears, the first skip is decoded to obtain the address a2 of the second skip, which is 
also decoded..."; Moffat). 

Regarding Claim 28, the combination of Broder in view of Moffat discloses an 
inverted index, wherein the first skip entry further includes information to locate the next 

19 According to Broder, the pivot DID is the smallest DID that might be a candidate (Page 17, [0301], lines 
17 - 18). In addition, Broder discloses that this pivot term goes through an "if statement which finds a 
first pivot term with UB (upper bound) greater than the threshold (Fig. 27, item 5). This implies that the 
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skip entry in the inverted list (Fig. 27, items 13 and 22, Broder 20 ; and Page 14, Section 
4.1 : Skipping, 1 st - 3 rd paragraph in that page, "if synchronization points-additional 
locations at which decoding can commence - are introduced into the compressed 
inverted list...", "...To access the compressed list to see if document d appears, the first 
skip is decoded to obtain the address a2 of the second skip, which is also decoded..."; 
Moffat). 

Regarding Claim 29, the combination of Broder in view of Moffat discloses an 
inverted index, wherein the boundary posting includes a document identifier having the 
higher value in the range of values and the block of postings precede the first skip entry 
in the inverted list (Page 17, [0302], lines 22 - 26, Broder 21 ; and Page 14, Section 4.1: 
Skipping, 1 st - 3 rd paragraph in that page, "if synchronization points-additional locations 
at which decoding can commence - are introduced into the compressed inverted list...", 
"...To access the compressed list to see if document d appears, the first skip is decoded 
to obtain the address a2 of the second skip, which is also decoded..."; Moffat). 

Regarding Claim 30, the combination of Broder in view of Moffat discloses an 
inverted index, as disclosed above, wherein the first skip entry further includes 
information to locate the next skip entry in the inverted list (Fig. 27, items 13 and 22, 

"next" method will return the smallest possible document number following the last one. In addition, the 
smallest document number corresponds to the lower value claimed. 

20 Posting[aterm] (located in the function nextQ) represents the information to locate next skip. 

21 The next() function iterates trough the list and selects from the preceding terms the term with the 
location greater (largest document number of documents) than the pivot location. Wherein the location 
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Broder 22 ; and Page 14, Section 4.1: Skipping, 1 st - 3 rd paragraph in that page, "if 
synchronization points-additional locations at which decoding can commence - are 
introduced into the compressed inverted list...", "...To access the compressed list to see 
if document d appears, the first skip is decoded to obtain the address a2 of the second 
skip, which is also decoded..."; Moffat). 

Regarding Claim 32, the combination of Broder in view of Moffat discloses an 
inverted index, wherein substantially all of the index terms occur in x documents or 
fewer (Page 1 5, [0257], lines 14-16, Broder). 

Regarding Claim 33, the combination of Broder in view of Moffat discloses an 
inverted index, wherein approximately 80% of the index terms occur in x documents 
(Page 17, [0307], lines 1 - 3, top n results, Broder). 

Regarding Claim 34, the combination of Broder in view of Moffat discloses an 
inverted index, wherein the collection of one or more documents includes one or more 
binary files, data tables, source code files, text documents or combinations thereof 
(Page 9, [0158], lines 1-13, Broder). 

Regarding Claim 35, the combination of Broder in view of Moffat discloses an 
inverted index, wherein the one or more inverted lists further include a skip entry before 



greater (largest document number of documents) than the pivot location corresponds to the higher value 
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the first posting in the inverted list (Page 15, [0257], lines 12-20, the result is inserted, 
Broder; and Page 14, Section 4.1: Skipping, 1 st - 3 rd paragraph in that page, "if 
synchronization points-additional locations at which decoding can commence - are 
introduced into the compressed inverted list...", "...To access the compressed list to see 
if document d appears, the first skip is decoded to obtain the address a2 of the second 
skip, which is also decoded..."; Moffat). 

Regarding Claim 36, the combination of Broder in view of Moffat discloses an 
inverted index, wherein the one or more inverted lists further include a skip entry after 
the last posting in the inverted list (Page 14, [0245], lines 10-12, Broder; and Page 14, 
Section 4.1: Skipping, 1 st - 3 rd paragraph in that page, "if synchronization points- 
additional locations at which decoding can commence - are introduced into the 
compressed inverted list...", "...To access the compressed list to see if document d 
appears, the first skip is decoded to obtain the address a2 of the second skip, which is 
also decoded..."; Moffat). 

8. Claim 8, 1 5, 31 , 44, and 51 are rejected under 35 U.S.C. 1 03(a) as being 
unpatentable over Broder et al. (Broder hereinafter) (US Patent Application Pub No. 
2004/0243560 A1 , filed: May 30, 2003), in view of Moffat et al. (Moffat hereinafter) 
(Non-Patent Literature: "Self-Indexing Inverted Files for Fast Text Retrieval"; Alistar 
Moffat, and Justin Zobel; February 1994, 1994 Australian Database Conference, and 



as claimed. 
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1994 IEEE Conference on Data Engineering), and further in view of Young et al. (Young 
hereinafter) (US Patent No. 5,838,950, issued: November 17, 1998). 

Regarding Claim 8, and 44, the combination of Broder in view of Moffat discloses 
all the limitations as discussed above including: wherein x is selected and y is selected, 
wherein y is not selected to have the same value as x (Page 15, [0277], lines 1 - 2, k be 
the smallest index, Broder; and Page 13, Section 4.1 Skipping, paragraph 1, 
"...suppose that p1 synchronisation points are allowed. The index into the inverted list 
contains p1 "document number, bit address" pairs, and can itself be stored as a 
compressed sequence of "difference in document number, difference in bit address run 
lengths...", Moffat). However, Broder is silent with respect to a first range of 256 to 512 
and a second range of 128 to 256. On the other hand, Young discloses a system 
including bits and bytes including ranges of 256 to 512, and of 128 to 256 (Col. 140, 
lines 30 - 43, Young). It would have been obvious to one of ordinary skill in the art at 
the time the invention was made to incorporate the Young's teachings to the system of 
Broder, by selecting the values of x and y according to ranges. Skilled artisan would 
have been motivated to select the values of x and y from ranges 256 to 512, and of 128 
to 256, as suggested by Young (Col. 4, lines 1 - 5 and 38 - 43,Young), to provide a 
higher speed system, "leading" address byte offset to a specified bit boundary, and 
"trailing" address byte offsets. In addition, the applied references (Broder, Moffat, and 
Young) teach features that are directed to analogous art and they are directed to the 



Posting[aterm] (located in the function nextQ) represents the information to locate next skip. 
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same field of endeavor of database management system, such as, data manipulation, 
and indexing, and values representative of locations. This relation between the applied 
references highly suggests an expectation of success. 

Regarding Claim 1 5, and 51 , the combination of Broder in view of Moffat and 
further in view of Young discloses an article comprising a machine-readable medium 
storing instructions operable to cause one or more machines to perform operations 
comprising: 

receiving a collection of documents, each document comprising one or more 
index terms (Page 14, [0244], lines 3-5, Broder); 

determining a first value x representative of first location of a first skip entry in an 
inverted index (Page 14, Section 4.1: Skipping, 1 st - 3 rd paragraph in that page, "if 
synchronization points-additional locations at which decoding can commence - are 
introduced into the compressed inverted list...", "...To access the compressed list to see 
if document d appears, the first skip is decoded to obtain the address a2 of the second 
skip, which is also decoded..."; Moffat), wherein at least a majority of the index terms 
occur in x documents (Page 17 and 18, [0307] and [0314], lines 1-3 and 3-9; 
respectively, Broder 23 ) and x is an integer (Page 10, [0182], lines 6-8, Broder) in a first 
range of 256 to 512 (Col. 140, lines 30 - 43, Young); 



Wherein the top "n" documents corresponds to the documents with the majority of index terms claimed; 
specifically "n" corresponds to the value x claimed. The scoring procedure utilized by Broder (as disclosed 
in detailed in Page 16, [0286], lines 5-6, Broder) including terms associations with upper bounds on its 
maximal contribution to documents scores; wherein the maximal contributing of index terms corresponds 
to the majority of the index terms as claimed. 
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determining a second value y representative of second location of a second skip 
entry in an inverted index (Page 14, 1 st and 2 nd paragraph, "...the inverted list becomes 
a sequence of blocks of three pairs each, with skips separating the blocks...", and "...a2 
is the address of the first bit of the second skip pair, a3 is the address of the first bit of 
the third skip, and so on...", Moffat), wherein y does not exceed the value of x (Page 15, 
[0277], lines 1 - 2, k be the smallest index, Broder) and is an integer (Page 10, [0182], 
lines 6-8, Broder) in a second range of 128 to 256 (Col. 140, lines 30 - 43, Young); 

generating the inverted index for the collection of documents (Page 9, [0162], 
lines 1-3, Broder; and Page 1, Abstract, "...an internal index in each inverted list...", 
Moffat), the inverted index including an inverted list for each of the index terms (Page 
14, [0244], lines 2-3, Broder), each inverted list including at least one posting (Page 
14, [0244], lines 2-5, Broder) and, if the number of postings exceeds x (Page 14, 
[0245], lines 7-10, Broder 24 ), further including the first skip entry after the xth posting 
(Page 14, [0245], lines 10-12, Broder; and Page 13, Section: 4.1 Skipping, "...When 
k«p, faster performance is possible if synchronization points-additional locations at 
which decoding can commence-are introduced into the compressed inverted list...", 
Moffat) and one or more second skip entries thereafter at intervals of every yth posting 
(Page 15, [0277], lines 1 - 6, Broder 25 ; and Page 14, 1 st and 2 nd paragraph, "...the 
inverted list becomes a sequence of blocks of three pairs each, with skips separating 

24 Broder discloses that the method next(id) (page 14, [0245], lines 8-10). This method states that if 
there is not such document which DID >=id (where DID = number of documents, and id = number of 
posting), then the term iterator returns a special posting that is larger than all the existing DIDs. This 
implies that, if there is DID < id (the number of postings exceeds the number of documents entered), then 
the iterator returns the special posting mentioned above. 

25 The cursor is advanced to the position of k value. There is a skip entry at the kth value. 
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the blocks. . .", and ". . .a2 is the address of the first bit of the second skip pair, a3 is the 
address of the first bit of the third skip, and so on. . .", Moffat), 
wherein: 

the at least one posting includes a document identifier identifying a 
document in the collection of documents (Page 14, [0244], lines 1-6, document unique 
identifier DID, Broder; and and Page 3, Section 2: Document Database, 1 st paragraph, 
"Each document is known by a unique indetifier...", Moffat); 

the first and second skip entries include a document identifier that is 
included in a boundary posting of a block of postings immediately adjacent to the skip 
entry in the inverted list (Page 15, [0277], lines 1 - 6, Border 26 ; and Page 13, Section 
4.1 Skipping, 6 th paragraph, "... For example, suppose that p1 synchronisation points 
are allowed. Then the index into the inverted list contains p1 'document number, bit 
address' pairs, and can itself be stored as a compressed sequence of 'difference in 
document number, difference in bit address'..."; Moffat), where a block of postings 
includes postings having document identifiers ranging from a lower to an upper value 
and where a boundary posting is a posting having a document identifier of either the 
lower or the upper value (Page 16 and 18, [0286] and [0314], lines 5 -9 and 3-9; 
respectively, Broder). 



Wherein doc_i corresponds to the document identifier claimed. 



Application/Control Number: 1 0/601 ,258 Page 24 

Art Unit: 2162 

Additional limitations of claims 15 and 51, not included above, have been 
rejected under the same criteria as claims 8, 31 , and 44 (See claims 8, 31 , and 44 - 
Claim Rejections - 35 USC § 112- listed on this office action). 

Regarding Claim 31 , the combination of Broder in view of Moffat and further in 
view of Young discloses an inverted index of claim 25, wherein x is selected from a first 
range of 256 to 512 and y is selected from a second range of 128 to 256 (Col. 140, lines 
30 - 43, Young), wherein y cannot be selected to have the same value as x (Page 15, 
[0277], lines 1 - 2, k be the smallest index, Broder; and Page 13, Section 4.1 Skipping, 
paragraph 1, "...suppose that p1 synchronisation points are allowed. The the index into 
the inverted list contains p1 "document number, bit address" pairs, and can itself be 
stored as a compressed sequence of "difference in document number, difference in bit 
address run lengths...", Moffat). 
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Response to Arguments 

1 . In response to applicant's argument that the references fail to show certain 
features of applicant's invention, it is noted that the features upon which applicant relies 
(i.e., "...using skip values..."; and "...inserts a first skip entry based on the determined 
frits value x and subsequent skip entries based on the ...") are not recited in the 
rejected claim(s). Although the claims are interpreted in light of the specification, 
limitations from the specification are not read into the claims. See In re Van Geuns, 988 
F.2d 1181, 26 USPQ2d 1057 (Fed. Cir. 1993). 
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Conclusion 

1 . The prior art made of record and not relied upon is considered pertinent to 
applicant's disclosure. 



2. THIS ACTION IS MADE FINAL Applicant is reminded of the extension of time 
policy as set forth in 37 CFR 1 . 1 36(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
mailed until after the end of the THREE-MONTH shortened statutory period, then the 
shortened statutory period will expire on the date the advisory action is mailed, and any 
extension fee pursuant to 37 CFR 1 .136(a) will be calculated from the mailing date of 
the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the mailing date of this final action. 
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Prior art Made of Record 

1 . Broder et al. (US Patent Application Pub. No. 2004/0243560 A1 ) discloses a 
system, method and computer program product for performing unstructured information 
management and automatic text analysis, including an annotation inverted file system 
facilitating indexing and searching. 

2. Antoshenkov (US Patent No. 6,439,783 B1) discloses a range-based query 
optimizer. 

3. Huynh et al. (US Patent No. 5,539,899) discloses a system and method for 
handling a segmented program in a memory for a multitasking data processing system 
utilizing paged virtual storage. 

4. Young et al. (US Patent No. 5,838,950) discloses a method of operation of a host 
adapter integrated circuit. 

5. Moffat et al. (Moffat hereinafter) (Non-Patent Literature: "Self-Indexing Inverted 
Files for Fast Text Retrieval"; Alistar Moffat, and Justin Zobel; February 1994, 1994 
Australian Database Conference, and 1994 IEEE Conference on Data Engineering) 
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Points Of Contact 



Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to Giovanna Colan whose telephone number is (571 ) 272- 
2752. The examiner can normally be reached on 8:30 am - 5:00 pm. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, John Breene can be reached on (571 ) 272-4107. The fax phone number for 
the organization where this application or proceeding is assigned is 571-273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 

Giovanna Colan 
Examiner 
Art Unit 2162 
May 8, 2007 
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